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Abstract: The taper functions of Kozak (1988), Bi (2000) and Fang et al. (2000) were comparatively analyzed in the present paper to de¬ 
velop a system for calculating the merchantable volume of oaks in the northwestern region of the state of Chihuahua (Mexico). Taper data 
corresponding to 298 trees were collected in mixed and uneven-aged pine-oak stands located throughout the study area, and covering the 
existing range of ages, stand densities and sites. Results show that the compatible segmented model developed by Fang et al. (2000) best 
described the experimental data and is therefore recommended for estimating tree diameter at a specific height, height to a specific diameter, 
merchantable volume, and total volume for oaks. The equation developed in this study is a fundamental tool for use in forest surveys in the 
study region and is simple enough to ensure its operational implementation. The results of the statistical analysis show that the equation can 
be recommended for other regions, although some local adaptations may be needed. 
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Introduction 

Many efforts have been made to model tree taper and commer¬ 
cial volume. During the last 30 years, studies have ranged from 
straight forward taper models (Kozak et al. 1969; Ormerod 1973; 
Hilt 1980; Lee et al. 2003; Hussain et al. 2008) to polynomial 
equations (Bruce et al. 1968; Max and Burkhart 1976; Cao et al. 
1980), hyperbolic models (Wabo et al 2002), geometric and 
trigonometric models (Parresol and Tomas 1996; Fang and Bai¬ 
ley 1999; Zhang et al. 2002; Bi 2000), and Spline functions 
(Castedo 2003). However, Newnham (1988) states that there are 
at least two reasons to continue studying this topic: 1) There is 
still no theory that adequately explains the variability in the 
shape of the stem for all trees, and 2) equations for estimating 
tree taper constitute an efficient tool for estimating both com- 
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mercial and total volume. 

Oaks ( Quercus , Fagaceae) are one of the largest and the most 
important taxonomic groups in Mexico, where almost 200 spe¬ 
cies of the 400 reported throughout the world occur (Rzedowski 
1978). The forest region in the state of Chihuahua comprises an 
area of approximately 4,598,454 hectares and is very important 
in terms of the country’s timber production. In this region, oaks 
are the second most important group of species in terms of its 
commercial value, the distribution range as well as the harvested 
timber volume (SEMARNAT 2005). Harvesting of oak species 
provides an important income in the region, where the main 
economic activity is the social forestry practice. 

Diverse species of oaks occur in La Sierra Madre Occidental 
(western Mexico) and their taxonomy is complex. Moreover, 
many of the taxa defined by Trelease (1924) as species are actu¬ 
ally only forms of several species. Problems related to classifica¬ 
tion are intensified by the presence of interspecific hybridization 
and genetic introgression between many species. In the State of 
Durango alone at least 43 species are known (Gonzalez and 
Gonzalez 1995). Foresters therefore manage oaks as a species 
group, and estimate harvestable timber volume at the genus 
level. 

Despite the importance of oaks in Mexico, these species have 
received scant attention in terms of timber production, since they 
have been considered to be of less commercial interest than pine 
species. Consequently, there are few studies regarding methods 
of determining stem standing volume (e.g. Pompa et al. 1998). 
Most studies carried out to test the suitability of different taper 
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functions for describing the stem profile of different tree species 
have been perfonned with pines (Navar and Dominguez 1997; 
Navar et al. 1997; Corral-Rivas et al. 2007). These studies are 
very useful for both forest research and forestry practice, and 
may help in attaining sustainable forest management of this tim¬ 
ber resource. To date, no studies have been conducted to develop 
merchantable volume equations for oaks in the study area. The 
objective of this research was therefore to develop a merchant¬ 
able volume system for oaks ( Quercus , Fagaceae) in the north¬ 
western region of the State of Chihuahua in Mexico. The analy¬ 
ses involved the use of tenable statistical assumptions for ac¬ 
counting for the main problems associated with the construction 
of taper functions and volume equations (multicollinearity, 
autocorrelated errors and heteroscedasticity), while examining 
the residuals associated with each position and diameter classes. 

Materials and methods 

Study site 

The study was conducted in the northwestern region of the state 
of Chihuahua (Mexico), in an area of 251 960 ha between longi¬ 
tudes of 108°15’ and 108°45’ West and latitudes of 28°45’ and 
30°00’ North (Fig. 1). The study area includes parts of the mu¬ 
nicipalities of Madera and Casas Grandes, in the high range of 
the Sierra Madre Occidental, where the topography is variable, 
with irregular elevations and depressions. The predominant for¬ 
est types are pine-oak stands, often mixed with Pseudotsuga and 
Juniperus, among other species (SEMARNAT 2005). 



Description of data 

Taper data corresponding to 298 trees were collected in mixed 
and uneven-aged stands located throughout the study area, and 
covering the existing range of ages, stand densities and sites. The 
trees were subjectively selected to ensure a representative distri¬ 
bution of diameter classes. The diameter at breast height ( D , in 
cm) and the total height ( H , in m) were measured in each tree. 
The trees were cut into logs at 1 to 3 m intervals along the stem. 
Two perpendicular diameters outside-bark were measured in 
each section, and then averaged. Log volumes were calculated in 
cubic meters, with Smalian’s formula. The top section was 
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treated as a cone. Outside-bark total stem volume (above stump) 
was obtained by summing the outside-bark log volumes and the 
volume of the top of the tree. 

Summary statistics of the final data set used in this study are 
shown in Table 1. Plots of relative height (h/H) against relative 
diameter ( d/D ) for each species are shown in Fig. 2. The width of 
the distribution of the data reflects the size and shape of the trees 
constituting the sample. 

The following notation will be used hereafter: 

D= diameter at breast height over bark (1.3 m above ground, cm); 
d= diameter over bark at height h (cm); 

H= total tree height (m); 

h= height above ground to diameter d (m); 

hst= stump height (m); 

V= total stem volume under bark from stump (m 3 ); 

v= merchantable volume under bark f m). the volume from stump to the 
point where diameter = d\ 

a,, bi, pi= coefficients to be estimated through the adjustment; 

£■=71/40000, a metric constant for converting from squared diameter, in cm 2 , 
to cross-section area, in irT; 
q= h/H 


Table 1. Summary statistics for the sampled trees 


Variable 

No. of 

Observations 

Media 

Standard 

deviation 

Maximum 

Minimum 

D (cm) 

298 

40.9 

17.1 

81.5 

8.2 

D (cm) 

1646 

25.2 

19.5 

109.5 

0 

H(m) 

298 

14.9 

5.7 

34.75 

4.0 

H (m) 

1646 

7.1 

5.6 

34.75 

0.1 



0 0.2 0.4 0.6 0.8 1.0 

h/H 


Fig. 2 Plot of relative diameter against relative height for the 298 
trees used to fit the taper equations 

Functions selected for comparison 

In the present study, the taper functions of Kozak (1988), Bi 
(2000) and Fang et al. (2000) were fitted to the data set. These 
functions provided good results in other applications and have 
shown great flexibility in fitting variable stem forms 
(Dieguez-Aranda et al. 2006; Corral-Rivas et al. 2007). The 
functions can be classified into two groups: variable-form taper 
models and segmented taper models. 

A variable-form taper model describes the stem shape with a 
changing exponent or variable from ground to top to represent 
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the neiloid, paraboloid, conic and several intermediate forms 
(Kozak 1988; Newnham 1988). This approach is based on the 
assumption that the stem fonn varies continuously along the 
length of a tree (Lee et al. 2003). In comparison with single and 
segmented taper models, this approach usually provides the low¬ 
est degree of local bias and greatest precision in taper predictions 
(e.g., Kozak 1988; Perez et al. 1990; Newnham 1992; Muhairwe 
1999; Rojo et al. 2005). However, the disadvantage is that they 
cannot be integrated analytically to calculate total stem or log 
volumes. In the present study, we analyzed the models proposed 
by Kozak (1988) and Bi (2000): 


occur; the first is close to the nonnal height (1.3 m) and the sec¬ 
ond occurs in a higher section of the stem). 

Fang et al. (2000) also derived a compatible model for mer¬ 
chantable v and total V volume by direct integration of the taper 
model. Their expressions are as follows: 


2 Tjk/h 

v=c,H m 


b x r 0 +(/j +I 2 ){b 2 -6t)jj + 

J 2 (b 3 -h^Ofo -d\-q) kll o( ]+ll O^ y 


( 4 ) 


V = a 0 D a 'H a2 


( 5 ) 


Kozak (1988) taper function: 


d = b x D h bf[ l -^L 

IW?. 

Bi (2000) taper function: 


b 4 q 2 +b 5 Ln(q+ 0,00l)+ 

*6i/? + V ,+ *sfy 


(1) 
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V 2 J 


Ln sin 




V 2 h j 


b x +b 2 sen\ —- | +b 2 cosl |+ 


3 Kq 


V 1 


+b 5 D+b 6 qpD +b- J q 2 [h 


( 2 ) 


In describing tree taper, it is generally accepted that a tree 
stem can be divided into three geometric shapes: the top is con¬ 
sidered as a cone, the central section a frustum of a paraboloid, 
and the butt a frustum of a neiloid. Segmented models describe 
these shapes by fitting each with an equation, and then mathe¬ 
matically joining the segments to produce an overall segmented 
function. From this group of segmented taper models, we ana¬ 
lyzed the compatible model developed by Fang et al. (2000). 


Fang et al. (2000) taper function: 

d = c^H^' (1 - qf^^a^a’ 2 


a 0 D a ' H“ 2 ~ klbl 

h \ ('i, - r x)+b 2 (r x -a x r 2 ) + b 2 a { r 2 ’ 

fj = b \ Ah+l1 ) b I 2 ' bi 2 ; a x = (l - p x ; 

Where , . fe-M* 

OC 2 — (1 p 2 ) b 2 b- i , 

r o = 0 - K / H ) k h '; n = (i - Pi ) k h '; 

r i =^~Pif ih2 
f/[ = 1 if p x <q < p 2 ; 0 otherwise 
[ I 2 = 1 if p 2 < q < 1; 0 otherwise 

Pi = h^H and p 2 = h 2 /H (relative heights from the ground up to 
the sections where the two union points assumed in the model 


Although the development of the compatible system of Fang 
et al. (2000) is based on Eq. (5), any other volume equation can 
be used as input into the system. 

Model fitting 

To avoid problems in the estimation of the parameters, especially 
when h = H, i.e. when d = 0, a small value lower than the meas¬ 
urement limit used in the data collection was reassigned to those 
diameters equal to zero. Similarly, a value lower than the corre¬ 
sponding height measurement limit was subtracted from the 
heights equal to the total height. A similar approach was used by 
Fang et al. (2000) in order to avoid the logarithm of zero in 
model fitting. This approach allows the use of the entire dataset 
for fitting and does not significantly change the parameter esti¬ 
mates (Fang et al. 2000; Dieguez-Aranda et al. 2006; Cor- 
ral-Rivas et al. 2007). 

Estimation of the parameters was carried out with the MODEL 
procedure of SAS/ETS® (SAS Institute Inc. 2004), in which 
several methods for parameter estimation are available. For the 
models of Kozak (1988) and Bi (2000) we used generalized 
nonlinear least squares, since the models are composed of only 
one equation. The compatible system of Fang et al. (2000) has 
two components: a taper function and a volume equation to any 
specified diameter limit, in which the total volume equation is a 
special case included in the merchantable volume equation (i.e., 
when h = H, then * _ /(ALf)y \y e used the seemingly unrelated 
regression technique to fit the system of Fang et al. (2000), in 
which the random errors of the equations are correlated, but the 
equations are not really simultaneous (none of the endogenous 
variables in one equation of the system appears as dependent on 
the left-hand side of the other equation; Zellner 1962; Judge et al. 
1988; Rose and Lynch 2001). This approach requires estimation 
of the parameters of the taper function and recovery of the im¬ 
plied total volume equation. 

Multicollinearity, autocorrelation and heteroscedasticity 

There are several problems associated with stem taper and vol¬ 
ume equation analysis that violate the fundamental least squares 
assumptions of independence and equal distribution of errors 
with zero mean and constant variance, of which multicollinearity, 
autocorrelation and heteroscedasticity are three of the most im¬ 
portant. Although the least squares estimates of regression coef- 
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ficients remain unbiased and consistent in the presence of multi¬ 
collinearity, autocorrelation and heteroscedasticity, they are no 
longer efficient (Myers 1990). These problems may seriously 
affect the standard errors of the coefficients, thereby invalidating 
statistical tests that use t or F distributions and confidence inter¬ 
vals (Neter et al. 1990, p.300). Thus, appropriate statistical pro¬ 
cedures should be used in model fitting to avoid problems asso¬ 
ciated with heteroscedasticity and autocorrelated errors, and 
models with low multicollinearity should be selected whenever 
possible (Kozak 1997). 

To evaluate the presence of multicollinearity among variables 
in the models analyzed, we used the condition number, which is 
defined as the square root of the ratio of the largest to the small¬ 
est eigenvalue of the correlation matrix. According to Belsey 
(1991), if the condition number is between 5 and 10, collinearity 
is not a major problem, if it is in the range of 30-100, then there 
are problems associated with collinearity, and if it is in the range 
1 000-3 000 there are severe problems associated with collinear¬ 
ity. Since the database contains multiple observations for each 
tree (i.e., hierarchical data), it is reasonable to expect that the 
observations within each tree are spatially correlated, which 
violates the assumption of independent error terms. We used a 
continuous-time autoregressive error structure (CAR(x)) to ac¬ 
count for the inherent autocorrelation of the data. This error 
structure enables the model to be applied to irregularly spaced, 
unbalanced data (Zimmerman and Nunez-Anton 2001; 
Dieguez-Aranda et al. 2006; Corral-Rivas et al. 2007). To test for 
the presence of autocorrelation and the order of the CAR(x) to be 
used, graphs of residuals plotted against lag-residuals from pre¬ 
vious observations within each tree were examined visually. 
Appropriate fits for the models with correlated errors were made 
by including the CAR(x) error structure in the MODEL proce¬ 
dure of SAS/ETS®system (SAS Institute Inc. 2004), which al¬ 
lows for dynamic updating of the residuals. 

Model comparison 


RMSE = ^(y i -y l ) 2 Kn-p ) (9) 

AAIC-H-lna 2 +2-(/; + i)-minfrlna 2 +2(/H-l)) 

with c 2 =2>,.-j>,.) 2 /n (10) 

j=i 

where f yy is ti le coefficient of correlation between the meas¬ 
ured ( y j) and estimated ( y ) values of the dependent variable, n is 

the total number of observations used to fit the model, p is the 
number of model parameters, and a 2 the estimator of the vari¬ 
ance of the model error. 

Because the quality of fit does not necessarily reflect the qual¬ 
ity of future prediction (Myers 1990, p. 168), assessment of the 
validity of the model with an independent data set is recom¬ 
mended (Kozak and Kozak 2003). Due to the scarcity of such 
data, several methods have been proposed (e.g., splitting the data 
set or cross-validation, double cross-validation), although they 
seldom provide any additional information compared with the 
respective statistics obtained directly from models built from 
entire data sets (Kozak and Kozak 2003). Thus, because deci¬ 
sions have to be made with available information, it is better to 
wait to obtain new data before such validation is carried out 
(Dieguez-Aranda et al. 2006; Corral-Rivas et al. 2007). 

The taper functions were also assessed by use of box plots for 
(/residuals by position (percent relative height points along the 
stem, i.e., 5%, 15%, 25%, and so on up to 95%). We did the 
same for h residuals by d classes and for V residuals by D classes. 
These graphs, calculated by position or diameter class, are very 
important for showing areas or diameter classes for which the 
functions provide especially poor or good predictions. 

Results and discussion 


The criteria used for judging the performance of the taper func¬ 
tions were based on numerical and graphical analyses of the 
residuals. Three goodness-of-fit statistics were examined: bias 
(E ), coefficient of detennination for nonlinear regression (R~) 
(see Ryan 1997, pp. 419 and 424), root mean square error ( RMSE ) 
and the Akaike information criterion in differences (A AIC), 
which is an index used to select the best model and is based on 
minimizing the distance of Kullback-Liebler. Although there are 
several shortcomings associated with the use of the R 2 in nonlin¬ 
ear regression, the general usefulness of some global measure of 
model adequacy would appear to override some of these limita¬ 
tions (Ryan 1997, p. 424). The expressions of these fitting statis¬ 
tics are summarized as follows: 



Initially, the models were fitted by use of nonlinear least squares 
without expanding the error terns to account for autocorrelation. 
A trend in residuals as a function of lagl- and lag2-residuals 
within the same tree was apparent in all the models analyzed, as 
expected because of the longitudinal nature of the data used for 
model fitting. Figure 3 (first row) provides an example of this 
with the model of Fang et al. (2000). After correcting for auto¬ 
correlation with a second-order continuous-time autoregressive 
error structure, the trends in the residuals disappeared (Fig. 3 
third row). No higher-order autocorrelation was observed (Fig. 3, 
third column). The estimations provided by the model were not 
significantly different from the models fitted without considering 
such correction. The sole purpose of autocorrelation correction 
was to improve interpretation of the statistical properties of the 
model, and it has no practical application unless several meas¬ 
urements of diameter at different heights are considered in the 
same individual tree. 
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Fig. 3 Graphs of d 
residuals plotted 

against: 

Lagl-Residuals (left 
column), 

Lag2-Residuals (mid¬ 
dle column), and 
Lag3-Residuals (right 
column) for the model 
of Fang et al. (2000) 
fitted without consid¬ 
ering the autocorrela¬ 
tion parameters (first 
row), and with con¬ 
tinuous-time first and 
second order autore¬ 
gressive error struc¬ 
tures (second and third 
rows, respectively) 


Table 2. Parameter estimates, approximated standard errors, good- 
ness-of-fit statistics and condition number of the models analyzed 


Model Parameter Estimate 


Standard Goodness-of-fit statistics Condition 
error R : RMSE E AAIC number 


b, 1.5584 0.2024 

b 2 0.7836 0.0466 


Kozak 

(1988) 

b 3 

b 4 

b j 

b 6 

by 

b s 

1.0020 

0.3885 

-0.3027 

2.5502 

-0.9362 

0.0145 

0.0011 

0 1170 0.94 
0.0276 

0.3143 

0.1575 

0.0044 

4.73 -0.17 

106.12 

130.71 


b, 

4.0586 

0.1365 





b 2 

-0.7308 

0.0456 




Bi 

b 3 

n.s 





(2000) 

b 4 

-2.3205 

0.0844 0.93 

5.44 0.24 

636.71 

97.66 


bs 

n.s 






b 6 

0.0298 

0.0096 





by 

-0.058 

0.0139 





a 0 

0.000076 6.71x10 -6 





aj 

1.8172 

0.026 





a.2 

1.0014 

0.0246 




Fang 

bi 

6.27*10" 6 

3.96xl0' 7 




et al. 



, 0.95 

4.59 -0.04 

0 

35.75 


b 2 

3.1x10" 6 

4.68xl0‘ 7 




(2000) 








b 3 

3.9* 10' 5 

1.7x10-6 





Pi 

0.0556 

0.0031 





P2 

0.5992 

0.0331 





n.s = not significant 


Table 2 shows the parameter estimates and their corresponding 
approximated standard errors along with the goodness-of-fit 
statistics. All the parameters were found to be significant at the 
1% significance level, except for parameters bj and b 5 in the 
model of Bi (2000), which were excluded from the model be¬ 


cause the null hypothesis could not be rejected, which suggests 
that the values are equal to zero. The multicollinearity of the 
models was moderate, as inferred from the condition numbers, 
which had values of 37.75, 97.66, and 130.71 for the models of 
Fang et al. (2000), Bi (2000) and Kozak (1988), respectively. All 
models accounted for more than 93% of the total variance of d 
along the stem, particularly the function of Fang et al. (2000), 
which explained more than 95% of this variability (see the R 2 
values in Table 2). The values of the mean square error were 4.7, 
5.4 cm, and 4.6, for the models of Kozak (1988), Bi (2000), and 
Fang et al. (2000), respectively. According to these good- 
ness-of-fit statistics, the model of Fang et al. (2000) best de¬ 
scribed the data for diameter to a specific height. Moreover this 
function showed the smallest multicollinearity as inferred from 
the condition number. 

The box plots of d, and h residuals against relative heights and 
D classes, respectively (Fig. 4) showed a non-homogeneous dis¬ 
tribution of the residuals for the variables d and h for the model 
of Fang et al. (2000). Fiowever, some unsatisfactory estimations 
were observed in all zones, although the mean was not skewed 
and the 25% and 75% quartiles of the distribution of residuals 
were very close to zero. At the bottom of the stem, where most 
of the volume is accumulated and where greater irregularities are 
usually present, especially in oaks, the residuals in diameter es¬ 
timation are higher. This is caused by the lack of observations 
below a height of 0.5 m from the stump, which would improve 
the fit in this zone. 

In summary, the model of Fang et al. (2000) best described the 
data for both diameter along the stem and height to a specific 
diameter. Thus, the compatible volume system of Fang et al. 
(2000) is proposed as the most suitable for describing the stem 
profile and predicting stem volume of oaks in the northwest of 
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the State of Chihuahua. 
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Fig. 4 Box plots of d residuals (Y-axis, in cm) against relative height 
classes (X-axis, in percent) (left) and of h residuals (Y-axis, in m) 
against D classes (X-axis, in cm) (right) for the model of Fang et al. 
(2000). The plus signs represent the mean of prediction errors for the 
corresponding relative height and diameter at breast height classes, re¬ 
spectively. The boxes represent the interquartile ranges. The maximum 
and minimum diameter under bark and height prediction errors are rep¬ 
resented by the small horizontal lines at the upper and lower ends of the 
vertical lines, respectively 

Comparison of the taper function for different diameter classes 

One of the main objectives of this type of study is to obtain taper 
functions that are valid for a wide range of site indexes, densities 
and characteristics of the trees in a stand. Good discussions of 
the factors affecting the shape and profile of the trees are pro¬ 
vided by Larson (1963) and LeMay et al. (1991). There are many 
studies in which attempts have been made to relate the parame¬ 
ters of taper functions to characteristics of the site index or to the 
size of the tree. For example, Morris and Forslund (1992) ob¬ 
served that the microsite and climatic variables explained 61.9% 
and 38.7% of the variation in the taper function and the tree 
shape, respectively, with the model of Forslund applied to Pinus 




, 1 


]l 1 


10152025303 5404550556065707580 
Diameter class 


banksiana. Muhairwe et al. (1994) found that inclusion of the 
age, site index and/or variables related to the tree crown in the 
model of Kozak (1988), slightly improved the estimations ob¬ 
tained for Pseudotsuga menziesii, Populus tremuloides and Thuja 
plicata. 

Research efforts directed at developing taper functions that 
accounted for the effect that these factors have on the differences 
observed in the tape along the tree stem gave rise to, amongst 
others, the complete variable exponent models (e.g. Kozak 1988 
or Bi 2000) or segmented models (Fang et al. 2000). 

The size of a tree is another factor that has great effect in de¬ 
termining the taper of the tree. Bi and Hamilton (1998) observed 
the importance of the dimensions of the tree to its profile. More 
recently, in a study with several species of Eucalyptus, Bi (2000) 
showed that tree size affected the value of the inflexion point of 
its estimated profile function, which made him consider that the 
models that force the use of a common inflexion point for all 
trees are less flexible. Similar results were obtained by Bi and 
Long (2001) for Pinus radiata. 

In the model of Fang et al. (2000), two inflexion points (p; and 
pj), which are common to all of the trees, were used. The authors 
consider that two inflexion points are sufficient for most species, 
in detail, the first would be located close to the diameter at breast 
height (1.30 m) and the second one would be further up the stem. 
We therefore proceeded to analyze the effect of tree size, repre¬ 
sented by the diameter class, at such inflexion points. 

The first step was to group the trees in diameter classes of 10 
cm. The number of trees in each diameter class, as well as the 
mean, maximum and minimum values and the typical deviations 
in the heights and normal diameters are shown in Table 3. The 
number of trees in the higher diameter classes is very small and 
therefore all the trees with diameter larger than 60 cm were 
grouped in a single class. 


Table 3. Mean, minimum and maximum values, typical deviations in heights and diameters, and number of trees in each diameter class 


Class interval 
(cm) 

Trees 

(No.) 



Diameter (cm) 




Heights (m) 


Mean 

Standard 

Deviation 

Minimum 

Maximum 

Mean 

Standard 

Deviation 

Minimum 

Maximum 

<20 

57 

14.86 

3.0082 

8.20 

19.60 

8.71 

3.1455 

4.07 

18.72 

20-30 

59 

25.03 

3.0684 

20.00 

29.70 

10.53 

3.7542 

4.61 

18.83 

30-40 

56 

34.93 

2.9821 

30.00 

39.90 

13.60 

4.1000 

5.34 

22.80 

40-50 

46 

45.01 

2.8617 

40.00 

49.80 

15.36 

3.6282 

7.74 

21.95 

50-60 

47 

54.81 

2.8343 

50.00 

59.40 

15.58 

5.5831 

5.31 

25.69 

60-70 

25 

63.22 

2.3056 

60.00 

68.70 

16.44 

5.8764 

8.10 

28.26 

70-80 

5 

74.46 

3.2145 

70.00 

77.50 

22.15 

6.5013 

13.72 

30.60 

80-90 

3 

81.23 

0.2517 

81.00 

81.50 

29.21 

7.5715 

20.58 

34.75 


To assess whether the taper equation differs among diameter 
classes, the nonlinear extra sum of squares method was used 
(Bates and Watts 1988, pp. 103-104). This method requires the 
fitting of full and reduced models, and has frequently been ap¬ 
plied to assess whether separate models are necessary for differ¬ 
ent species or different geographic regions (e.g., Huang et al. 
2000; Zhang et al. 2002; Corral-Rivas et al. 2004; Corral-Rivas 
et al. 2005; Castedo et al. 2005). In this case, the reduced model 
corresponds to the same set of global parameters for all diameter 


classes, while the full model corresponds to different sets of 
global parameters for the inflexion points (p; and P 2 ) of the 
model in each diameter class. The foil model is obtained by ex¬ 
panding each global parameter by including an associated pa¬ 
rameter and a dummy variable to differentiate the diameter 
classes. 

Pi+Ctkh * = U k = 2,...,6 (ID 
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where p t is one of the inflexion points of the model of Fang et al. 
(2000), c,vt is a parameter associated with the inflexion point i and 
the diameter class k in the full model, and /*. is a dummy variable 
whose value is equal to 1 for the diameter class k and 0 for the 
other classes. The appropriate test statistic uses the following 
expression: 


F* = 


'SSE R -SSE f \ df F 
, df R ~df F )~SSeT f 


( 12 ) 


where SSE l is the error sum of squares of the reduced model, 
SSE F is the error sum of squares of the full model, and df R and 
df F are the degrees of freedom of the foil and reduced models, 
respectively. The non-linear extra sum of squares follows an 
F'-distribution. 


Pi 2 ; see Table 4). However, the size of the trees, represented by 
the diameter classes, has no effect on the position of the second 
inflexion point along the stem, since none of the estimators of the 
parameters associated with each diameter class were significant 
(P 22 , ■■■,P 26 ), i.e., all the diameter classes can be well represented 
with a single value (p 2I ). 

The autocorrelation for the full model was also corrected by 
use of a second-order continuous-time autoregressive error 
structure and as in the reduced model, the trends in residuals 
disappeared. The box plots of d, and h residuals against relative 
heights and D classes, respectively were very similar to those 
calculated for the reduced model (see Fig. 4) and showed a 
non-homogeneous distribution of the residuals for the variables d 
and h. 

Conclusions 


Table 4. Parameter estimates, approximated standard errors, good- 
ness-of-fit statistics and condition number of the model of Fang et al. 
(2000), fitted with the same inflexion points for each diameter class 
(reduced model) and with a different inflexion point per diameter 
class (full model) 


Model 


pa ti 
to B 
a. 


Goodness-of-fit statistics Con- 


Standard p 2 
Parameter Estimate R 

error 


a 0 

0.000076 6.71xl0‘ 6 

ai 

1.8172 

0.026 

a 2 

1.0014 

0.0246 

bi 

6.27xl0‘ 6 

3 - 96Xl0 ‘; 0.95 

b 2 

3.1xl0" 6 

4.68xl0" 7 

b 3 

3.9xl0‘ 5 

1.7xl0‘ 6 

Pi 

0.0556 

0.0031 

P2 

0.5992 

0.0331 


a 0 

0.000063 

6.57xl0‘ 6 

aj 

1.8675 

0.0301 

a 2 

0.9944 

0.0252 

bi 

6.08xl0‘ 6 

4.04xl0‘ 7 

b 2 

3.1xl0" 5 

4.71xl0‘ 7 

b 3 

Pn 

3.8xl0‘ 5 

0.0667 

1.69xl0‘ 6 

0.95 

0.0046 

PI3 

-0.0118 

0.0035 

Pl4 

-0.0122 

0.0034 

Pis 

-0.0182 

0.0035 

Pl6 

-0.0108 

0.0035 

P21 

0.5975 

0.0341 


RMSE E AAIC dition 
Num¬ 
ber 


4.59 -0.04 28.53 35.75 


4.56 -0.00 0 45.01 


Among the three taper functions assessed for describing the taper 
for oaks species in the northwestern region of the State of Chi¬ 
huahua (Mexico), the segmented compatible model of Fang et al. 
(2000) best described the experimental data. This model per¬ 
formed well in predicting upper-stem diameter, height to any 
specific diameter and total volume. The system of Fang et al. 
(2000) also has the advantage that the resulting taper, merchant¬ 
able and total volume equations are compatible with each other. 
The non-linear extra sum of squares method indicated significant 
differences in the estimator of the inferior inflexion point in the 
stem profile (i.e. that located around the breast height), but not 
for the second inflexion point. In this sense, Hernandez (2004) 
obtained results indicating that the inflexion points among the 
stem shapes decrease over time, in other words, taper in early 
growth classes differs from the taper presented by older indi¬ 
viduals. Therefore, separation of diameter classes is recom¬ 
mended for estimating upper-stem diameter, height to any spe¬ 
cific diameter and total volume, especially because Mexican 
oaks usually have unstable stem shapes. In this respect, Castedo 
(2003) pointed out that such modelling requires further research, 
since the exact number of inflexion points is still unknown. 
Spline functions may be useful for such purposes. The equation 
developed in the present study is a fundamental tool for applica¬ 
tion in forest surveys in the study region; it is simple enough to 
ensure its operational implementation. Moreover the statistical 
shows that its implementation in other similar regions can be 
recommended, although some local adaptations may be needed. 


Results of the fitting process for full and reduced fonns of the 
model of Fang et al. (2000) are shown in Table 4. The F statistic 
calculated by Eq. (12) was 4.3, and the probability of finding a 
critical value greater than 4.3 (i.e., F critical (1-a; dfR - df F ) > 4.3) 
was lower than 0.01. Differences among the taper equations were 
therefore observed for different diameter classes. This means that 
the full model better represents the general tendency of the sam¬ 
ple trees. The diameter classes have an important effect on the 
value of the first inflexion point, i.e., the inflexion point located 
around the diameter at breast height (all the parameters associ¬ 
ated with each diameter class are significant except for parameter 
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